Automatic term acquisition from domain-specific text collection by using Wikipedia
نویسندگان
چکیده
منابع مشابه
Exploiting Wikipedia to Identify Domain-Specific Key Terms/Phrases from a Short-Text Collection
Extracting from a given document collection what we call “domain-specific” key terms/phrases is a challenging task. By “domainspecific” key terms/phrases we mean words/expressions representative of the topical areas specific to the focus of a document collection. For example, when a collection is related to academic research (i.e., its focus is related to topics dealing with academic research),...
متن کاملDomain-Specific Knowledge Acquisition from Text
In many knowledge intensive applications, it is necessary to have extensive domain-specific knowledge in addition to general-purpose knowledge bases. This paper presents a methodology for discovering domain-specific concepts and relationships in an attempt to extend WordNet. The method was tested on five seed concepts selected from the financial domain: interest rate, stock market, inflation, e...
متن کاملAutomatic Acquisition of Script Knowledge from a Text Collection
In this paper, we describe a method for automatic acquisition of script knowledge from a Japanese text collection. Script knowledge represents a typical sequence of actions that occur in a particular situation. We extracted sequences (pairs) of actions occurring in time order from a Japanese text collection and then chose those that were typical of certain situations by ranking these sequences ...
متن کاملDomain Specific Automatic Question Generation from Text
The goal of my doctoral thesis is to automatically generate interrogative sentences from descriptive sentences of Turkish biology text. We employ syntactic and semantic approaches to parse descriptive sentences. Syntactic and semantic approaches utilize syntactic (constituent or dependency) parsing and semantic role labeling systems respectively. After parsing step, question statements whose an...
متن کاملMining for Domain-specific Parallel Text from Wikipedia
Previous attempts in extracting parallel data from Wikipedia were restricted by the monotonicity constraint of the alignment algorithm used for matching possible candidates. This paper proposes a method for exploiting Wikipedia articles without worrying about the position of the sentences in the text. The algorithm ranks the candidate sentence pairs by means of a customized metric, which combin...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Proceedings of the Institute for System Programming of RAS
سال: 2014
ISSN: 2079-8156,2220-6426
DOI: 10.15514/ispras-2014-26(4)-1